Adaptive Communication Mode For Recording A Media Message

ABSTRACT

A computer implemented method and system for determining a communication mode for recording a media message on a communication device is provided. An adaptive recording application detects activation of an interface element, for example, a push or a touch of a button on the communication device and determines duration of the activation of the interface element. The adaptive recording application, in response to the detection of the activation of the interface element, selects a communication mode, for example, a push to talk communication mode, a tap to start communication mode, etc., based on a comparison of the duration of the activation of the interface element with one or more configurable timing parameters and/or a determination of a presence or an absence of a media signal during and/or after the activation of the interface element. The adaptive recording application initiates recording of the media message in the selected communication mode.

CROSS REFERENCE TO RELATED APPLICATIONS

This is a continuation application of non-provisional patent applicationSer. No. 16/556,271 titled “Adaptive Communication Mode For Recording AMedia Message”, filed in the United States Patent and Trademark Officeon Aug. 30, 2019, which is a continuation application of non-provisionalpatent application Ser. No. 15/892,379 titled “Adaptive CommunicationMode For Recording A Media Message”, filed in the United States Patentand Trademark Office on Feb. 8, 2018, which is a continuationapplication of non-provisional patent application Ser. No. 14/608,193titled “Adaptive Communication Mode For Recording A Media Message”,filed in the United States Patent and Trademark Office on Jan. 29, 2015,which is a continuation application of non-provisional patentapplication Ser. No. 13/945,278 titled “Adaptive Communication Mode ForRecording A Media Message”, filed in the United States Patent andTrademark Office on Jul. 18, 2013, which claims the benefit ofprovisional patent application No. 61/673,728 titled “AdaptiveCommunication Mode For Recording An Audio Message”, filed in the UnitedStates Patent and Trademark Office on Jul. 19, 2012.

The specifications of the above referenced patent applications areincorporated herein by reference in their entirety.

BACKGROUND

Prior to the age of smart phones and mobile tablets, the need forrecording an audio message, a video message, or any other media messagewhile on the move was not high. One method to record an audio messagewas to make a phone call, speak on the phone, and record a user's voiceon the other end of the line, for example, when using answering machinesand voicemails, or when speaking on a recorded line. However, astechnology developed and smart phones became popular, many smart phoneapplications came up with the need to record audio messages and othermedia messages. For example, two communication modes for recording audiowere developed, that is, a push to talk (PTT) communication mode and atap to start (TTS) communication mode.

In the push to talk communication mode, a user pushes a button on theuser's communication device, and keeps the button pressed. The userrecords his or her message while the button is pushed down. Upon releaseof the button, the communication device terminates recording and savesthe message. This method provides the user with one click access torecording a short media message. The other communication mode, that is,the tap to start communication mode, requires a user to first click astart button or an initiation button on the communication device tostart the recording process. In this case, the user clicks, that is,taps and quickly releases the button, at which time the communicationdevice initiates recording of the media message. The user then has toclick a stop button on the communication device to stop the recording.After clicking the stop button, the communication device terminates therecording and saves the recorded media message. There is also avariation of the tap to start communication mode, where a user is notrequired to click the stop button on the communication device. Instead,the communication device performs voice activity detection to determinewhen the user has stopped speaking, and at this time the communicationdevice automatically terminates the recording and saves the audiomessage.

The push to talk communication mode allows a user to record a shortaudio message quickly, with a single press of a button on thecommunication device; that is, all the user is required to do is pressthe button and then release the button. However, in the push to talkcommunication mode, the user finds it difficult to record a long audiomessage. The user is forced to interact with the communication devicethroughout the recording of the audio message, which makes the processfor recording long audio messages inconvenient. While recording a longaudio message, the user's finger may inadvertently slip off the “push totalk” button on the communication device, which would indicate to thecommunication device that the user wants to terminate the recording,thereby causing the communication device to abruptly terminate therecording of the audio message even though the user intended to continuerecording the audio message. It is also not feasible for a user, whilepressing a button to place a smart phone at his/her ear to speak,forcing him/her to use a speakerphone.

The tap to start communication mode can be convenient for recording longaudio messages. The tap to start communication mode allows the user totap a button on the communication device, and have both his/her handsfree while recording an audio message, for example, a one minute audiomessage, a song being heard at a concert, etc. The tap to startcommunication mode allows a user to have minimal or no friction whilerecording a long audio message. That is, tapping a single button on thecommunication device to initiate the recording and then tapping a buttonagain only to terminate the recording allows the user to not have tointeract with the communication device while recording the audiomessage. However, in the tap to start communication mode, it isinconvenient to record short audio messages. Since the tap to startcommunication mode requires the use of two user interactions, it isdifficult for the user to record a short audio message. A short audiomessage, for example, “Hi!” would require the user to first tap the “tapto start” button on the communication device to initiate recording, thenspeak “Hi!”, and then immediately tap the same button again or adifferent button to terminate the recording.

In some solutions, options are provided in a “Settings” menu to allow auser to select which communication mode he or she would prefer using forrecording a media message. However, the user would still have theinconvenience of frequently switching between these options. Consider anexample where a user is in the middle of a lengthy back and forthconversation with one of his/her friends, and is simultaneously asking afew other friends if they are available to go to the movies. In order toevade the problems created by the tap to start communication mode andthe push to talk communication mode, the user would have to continuouslyinterrupt recording of audio messages by navigating to the “Settings”menu on the user's communication device and switching the communicationmode to the communication mode the user would like to use for the nextinteraction.

Hence, there is a long felt but unresolved need for a computerimplemented method and system that determines a communication mode forrecording a media message, for example, an audio message, a videomessage, an audiovisual message, a multimedia message, etc., on a user'communication device, by automatically deducing the user's intent whilerecording the media message.

SUMMARY OF THE INVENTION

This summary is provided to introduce a selection of concepts in asimplified form that are further disclosed in the detailed descriptionof the invention. This summary is not intended to identify key oressential inventive concepts of the claimed subject matter, nor is itintended for determining the scope of the claimed subject matter.

The computer implemented method and system disclosed herein addressesthe above stated need for determining a communication mode for recordinga media message on a user' communication device, by automaticallydeducing the user's intent while recording the media message. As usedherein, the term “media message” refers to a message that can berecorded on a user's communication device, for example, a cellularphone, a smart phone, a tablet computing device, a personal digitalassistant, a laptop, a touch centric device, etc., or any other mobiledevice configured for mobile communication. The media message is, forexample, an audio message, a video message, an audiovisual message, amultimedia message, etc., and any combination thereof. Also, as usedherein, the term “communication mode” refers to a mode for initiatingand terminating recording of a media message on a communication device.The communication mode can be, for example, a push to talk (PTT)communication mode or a tap to start (TTS) communication mode. Thecomputer implemented method and system disclosed herein provides anadaptive recording application executable by at least one processorconfigured to record the media message on the user's communicationdevice.

The adaptive recording application detects activation of an interfaceelement on the communication device for initiating the recording of themedia message. As used herein, the term “interface element” refers toany software or hardware implemented switch or button, for example, aphysical push button, a touch button or a tactile button, a screen icontype touch button, a wheel, a touch pad, etc., capable of receiving aninput from a user or capturing an application of pressure from the user.The adaptive recording application determines a duration of theactivation of the interface element. As used herein, the term “durationof activation” refers to a measure of time during which the user pressesand holds the interface element. That is, the duration of activation isthe measure of the length of time of a user's click on an interfaceelement. The adaptive recording application compares the duration of theactivation of the interface element with one or more configurable timingparameters. As used herein, the term “configurable timing parameters”refers to time parameters used by the adaptive recording application forcomparison with the duration of activation of an interface element toenable the adaptive recording application to select one of thecommunication modes for recording the media message. In an embodiment,the adaptive recording application configures the configurable timingparameters, for example, based on a user input or a system settingconfigured by the adaptive recording application, or through an analysisof the duration of the activation of the interface element, and/or thebehavior of the user determined from the activation of the interfaceelement by the user in the past, or during a training session. Theadaptive recording application selects, in response to the detection ofthe activation of the interface element, one of the communication modes,for example, the push to talk communication mode or the tap to startcommunication mode for recording the media message based on thecomparison of the duration of the activation of the interface elementwith one or more configurable timing parameters.

In an embodiment, the configurable timing parameters comprise a criticaltime. As used herein, the term “critical time” refers to a timeconfigured, for example, by a user input, a system setting, etc. In thisembodiment, the adaptive recording application compares duration of theactivation of the interface element with the critical time. The adaptiverecording application selects, in response to the detection of theactivation of the interface element, one of the communication modesbased on the comparison of the duration of the activation of theinterface element with the critical time. For example, if the durationof the activation of the interface element exceeds the critical time,the adaptive recording application selects the push to talkcommunication mode for recording the media message, and if the durationof the activation of the interface element is less than the criticaltime, the adaptive recording application selects the tap to startcommunication mode for recording the media message. The adaptiverecording application triggers recording of the media message on thecommunication device using the selected communication mode.

In an embodiment, the computer implemented method and system disclosedherein determines a communication mode for recording a media message ona user's communication device on determining a presence or an absence ofa media signal during and/or after the activation of the interfaceelement. As used herein, the term “media signal” refers to a signalcomprising media, for example, audio, video, etc., that can be detectedby the adaptive recording application to enable the adaptive recordingapplication to select a communication mode for recording a mediamessage. The media signal is, for example, an audio signal, a videosignal, an audiovisual signal, a multimedia signal, etc. In thisembodiment, on detection of the activation of the interface element onthe communication device, the adaptive recording application determinesa presence or an absence of a media signal during and/or after theactivation of the interface element. For example, the adaptive recordingapplication determines a presence of an audio signal by listening to anutterance of a user or other audible sounds such as music, naturalsounds such as birds chirping, etc., during the activation of theinterface element, or after the activation of the interface element, orduring and after the activation of the interface element. In anotherexample, the adaptive recording application determines an absence of anaudio signal by listening for a lack of an utterance, that is, silenceof the user and a lack of other audible sounds such as music, naturalsounds such as birds chirping, etc., during the activation of theinterface element, or after the activation of the interface element, orduring and after the activation of the interface element. The adaptiverecording application selects, in response to the detection of theactivation of the interface element, one of the communication modes forrecording the media message based on the determination of the presenceor the absence of the media signal. The adaptive recording applicationthen triggers recording of the media message on the communication deviceusing the selected communication mode. In an embodiment, the adaptiverecording application analyzes the media signal received during and/orafter the activation of the interface element for selecting one of thecommunication modes. In an embodiment, the adaptive recordingapplication initiates the recording of the media message prior to theselection of one of the communication modes. In another embodiment, theadaptive recording application initiates the recording of the mediamessage after the selection of one of the communication modes.

In another embodiment, the computer implemented method and systemdisclosed herein determines a communication mode for recording a mediamessage on a communication device based on a combination of thecomparison of the duration of the activation of the interface elementwith one or more configurable timing parameters and the determination ofa presence or an absence of a media signal during and/or after theactivation of the interface element. In this embodiment, the adaptiverecording application selects, in response to the detection of theactivation of the interface element, one of the communication modes forrecording the media message based on the combination.

In another embodiment, the configurable timing parameters comprise alower threshold time and an upper threshold time. As used herein, theterm “lower threshold time” refers to a time before which it isexplicitly clear that the user's intention is to use the tap to startcommunication mode based on the fact that the interface element is helddown for a short time. Also, as used herein, the term “upper thresholdtime” refers to a time after which it is explicitly clear that the userintends to use the push to talk communication mode based the fact thatthe user is holding the interface element down for a long time. In thisembodiment, the adaptive recording application compares the duration ofthe activation of the interface element with the lower threshold timeand the upper threshold time. The adaptive recording applicationselects, in response to the detection of the activation of the interfaceelement, one of the communication modes based on the comparison of theduration of the activation of the interface element with the lowerthreshold time and the upper threshold time. For example, if theduration of the activation of the interface element is less than thelower threshold time, the adaptive recording application selects the tapto start communication mode for recording a media message, and if theduration of the activation of the interface element is greater than theupper threshold time, the adaptive recording application selects thepush to talk communication mode for recording the media message.

In an embodiment, if the duration of the activation of the interfaceelement is greater than the lower threshold time and less than the upperthreshold time, the adaptive recording application selects one of thecommunication modes based on one or more of multiple options comprising,for example, a presence or an absence of a media signal, proximity ofthe duration of the activation of the interface element to the lowerthreshold time or the upper threshold time, type of the user'scommunication device, a user input clarifying the communication modeintended by the user, etc.

The computer implemented method and system disclosed herein detects theuser's intent while the user clicks on an interface element on theuser's communication device to record a media message. Since theadaptive recording application can detect what the user is intending todo, and can adapt to the user's needs, the computer implemented methodand system disclosed herein provides the user with a friction-freemethod for recording a media message in a communication mode that isconvenient and natural for the interaction the user has in mind, withoutthe need to manually switch between the communication modes.

BRIEF DESCRIPTION OF THE DRAWINGS

The foregoing summary, as well as the following detailed description ofthe invention, is better understood when read in conjunction with theappended drawings. For the purpose of illustrating the invention,exemplary constructions of the invention are shown in the drawings.However, the invention is not limited to the specific methods andcomponents disclosed herein.

FIG. 1A exemplarily illustrates a computer implemented method fordetermining a communication mode for recording a media message on acommunication device.

FIG. 1B exemplarily illustrates an embodiment of the computerimplemented method for determining a communication mode for recording amedia message on a communication device.

FIG. 1C exemplarily illustrates another embodiment of the computerimplemented method for determining a communication mode for recording amedia message on a communication device.

FIG. 2 exemplarily illustrates a flowchart comprising the steps fordetermining a communication mode for recording an audio message on acommunication device based on a comparison of a duration of anactivation of an interface element with a critical time.

FIG. 3 exemplarily illustrates a flowchart comprising the steps fordetermining a communication mode for recording an audio message on acommunication device based on a determination of a presence or anabsence of an audio signal.

FIG. 4 exemplarily illustrates a timing diagram showing a lowerthreshold time and an upper threshold time configured for determining acommunication mode for recording a media message on a communicationdevice.

FIG. 5 exemplarily illustrates a flow diagram comprising the steps forconfiguring a lower threshold time for determining a communication modefor recording a media message on a communication device.

FIG. 6 exemplarily illustrates a flow diagram comprising the steps forconfiguring an upper threshold time for determining a communication modefor recording a media message on a communication device.

FIGS. 7A-7C exemplarily illustrate a flowchart comprising the steps fordetermining a communication mode based on a comparison of the durationof the activation of the interface element with a lower threshold timeand an upper threshold time.

FIG. 8 exemplarily illustrates a computer implemented system fordetermining a communication mode for recording a media message on acommunication device.

FIG. 9 exemplarily illustrates the architecture of a computer systememployed by an adaptive recording application for determining acommunication mode for recording a media message on a communicationdevice.

DETAILED DESCRIPTION OF THE INVENTION

FIG. 1A exemplarily illustrates a computer implemented method fordetermining a communication mode for recording a media message on acommunication device. As used herein, the term “media message” refers toa message that can be recorded on a user's communication device, forexample, a cellular phone, a smart phone, a tablet computing device, anUltrabook®, a laptop, a personal digital assistant, a touch centricdevice, etc., or any other mobile device configured for mobilecommunication. The media message is, for example, an audio message, avideo message, an audiovisual message, a multimedia message, etc., andany combination thereof. Also, as used herein, the term “communicationmode” refers to a mode for initiating and terminating recording of amedia message on a communication device. The communication mode can be,for example, a push to talk (PTT) communication mode, a tap to start(TTS) communication mode, etc. The communication device may be used fortransmitting the recorded media message to another communication devicevia a network, for example, a mobile communication network. In the pushto talk communication mode, a user is expected to engage an interfaceelement for the entire duration of recording of a media message. Thatis, the recording initiates as soon as the user activates the interfaceelement and terminates only when the user releases the interfaceelement. As used herein, the term “interface element” refers to anysoftware or hardware implemented switch or button, for example, aphysical push button, a touch button or a tactile button, a screen icontype touch button, a wheel, a touch pad, etc., capable of receiving aninput from a user or capturing an application of pressure from the user.In the tap to start communication mode, the user is expected to activatethe interface element for a short time, which initiates the recording ofa media message, then to release the interface element, then speak, andthen activate the interface element a second time to terminate therecording of the media message.

The computer implemented method disclosed herein provides 101 anadaptive recording application executable by at least one processorconfigured to record the media message on the communication device. Theadaptive recording application detects 102 activation of an interfaceelement on the user's communication device for initiating the recordingof the media message. For example, the adaptive recording applicationdetects a click, a touch, a press, a push, a tap, etc., of a screen icontype touch button or a physical push button on the communication deviceby the user for initiating recording of the media message. After theinterface element is activated, the adaptive recording applicationselects the communication mode and initiates recording of the mediamessage of the user in the selected communication mode. The method fordetermining the communication mode for recording the media message isindependent of the type of the interface element being activated.

The adaptive recording application determines 103 a duration of theactivation of the interface element. As used herein, the term “durationof activation” refers to a measure of time during which the user pressesand holds the interface element. For example, the adaptive recordingapplication measures the length of time of a user's click on aninterface element, or the time for which the interface element ispressed. The adaptive recording application compares 104 the duration ofthe activation of the interface element with one or more configurabletiming parameters. As used herein, the term “configurable timingparameters” refers to time parameters used by the adaptive recordingapplication for comparison with the duration of activation of aninterface element to enable the adaptive recording application to selectone of the communication modes for recording the media message. In anembodiment, the adaptive recording application configures theconfigurable timing parameters, for example, based on a user input or asystem setting configured by the adaptive recording application, orthrough an analysis of the duration of the activation of the interfaceelement by the user in the past, and/or the behavior of the userdetermined from the activation of the interface element by the user inthe past, and/or during user training as disclosed in the detaileddescription of FIGS. 5-6. That is, the configurable timing parameterscan be fixed as a system settable parameter, as a user settableparameter, or may vary and be calculated based on one or more of anumber of factors, for example, type of the communication device,characteristics of the user, user actions, past user interactions,language, etc. In an embodiment, the configurable timing parameterscomprise a critical time. In another embodiment, the configurable timingparameters comprise a lower threshold time, an upper threshold time,etc. The adaptive recording application can also learn and change theconfigurable timing parameters based on the user's behavior with theadaptive recording application.

The adaptive recording application selects 105, in response to thedetection of the activation of the interface element, one of thecommunication modes for recording the media message based on thecomparison of the duration of the activation of the interface elementwith one or more configurable timing parameters. The adaptive recordingapplication selects, for example, the tap to start communication mode ifthe duration of the activation of the interface element is less than thecritical time. The adaptive recording application selects the push totalk communication mode if the duration of the activation of theinterface element is greater than the critical time. The adaptiverecording application triggers 106 the recording of the media message inthe selected communication mode. Once the communication mode is selectedby the adaptive recording application, the recording of the mediamessage continues using the selected communication mode.

FIG. 1B exemplarily illustrates an embodiment of the computerimplemented method for determining a communication mode for recording amedia message on a communication device. The computer implemented methoddisclosed herein provides 101 the adaptive recording application asdisclosed in the detailed description of FIG. 1A. The adaptive recordingapplication enables recording of the media message using thecommunication mode intended by the user and determined by the adaptiverecording application. The adaptive recording application detects 102activation of an interface element on the user's communication devicefor initiating the recording of the media message.

In this embodiment, on detection of the activation of the interfaceelement, the adaptive recording application determines 107 a presence oran absence of a media signal during and/or after the activation of theinterface element. As used herein, the term “media signal” refers to asignal comprising media, for example, audio, video, etc., that can bedetected by the adaptive recording application to enable the adaptiverecording application to select a communication mode for recording amedia message. The media signal is, for example, an audio signal, avideo signal, an audiovisual signal, a multimedia signal, etc. The mediasignals comprise, for example, voice signals received from the user whenthe user speaks into a microphone of the communication device andaudible sounds such as music, natural sounds such as birds chirping,etc., that are heard for recording an audio message. The adaptiverecording application, while detecting an activation of the interfaceelement, determines a presence or an absence of a media signal. Forexample, the adaptive recording application, while detecting a click ofa button on the communication device, determines a presence of an audiosignal by listening to an utterance of the user, or to other audiblesounds. In another example, the adaptive recording application, whiledetecting the click of a button on the communication device, determinesan absence of an audio signal by listening for a lack of an utterance,that is, silence of the user, lack of audible sounds such as music, lackof natural sounds such as birds chirping, etc., that the user intends torecord. The adaptive recording application, after detecting anactivation of the interface element, determines a presence or an absenceof a media signal. Also, for example, the adaptive recordingapplication, after detecting the click of a button on the communicationdevice, determines a presence of an audio signal by listening to anutterance of the user, other audible sounds, etc. In another example,the adaptive recording application, after detecting the click of abutton on the communication device, determines an absence of an audiosignal by listening for a lack of an utterance, that is, silence of theuser, a lack of audible sounds such as music, a lack of natural soundssuch as birds chirping, etc., that the user intends to record. Theadaptive recording application performs the processing and the detectionof the media signal using hardware and/or software. In an embodiment,the adaptive recording application determines a presence or an absenceof a media signal during and after the activation of the interfaceelement. In another embodiment, the adaptive recording applicationdetermines a presence or an absence of a media signal during and/orafter deactivation of the interface element. For example, the adaptiverecording application determines the presence or the absence of theaudio signal during and/or after the release of a button on thecommunication device.

The adaptive recording application selects 108, in response to thedetection of the activation of the interface element, one of thecommunication modes for recording the media message based on thedetermination 107 of the presence or the absence of the media signal.The adaptive recording application deduces the intent of the user by acombination of the activation of the interface element and the presenceor the absence of an utterance of audio by the user, other audiblesounds, etc. If the activation of the interface element occurs alongwith the determination of a presence of a media signal concurrently orsoon thereafter, then the adaptive recording application may select thepush to talk communication mode. If the activation of the user interfaceelement occurs without a subsequent determination of a presence of amedia signal, then the adaptive recording application may select the tapto start communication mode. In another example, if the activation ofthe interface element occurs along with the subsequent determination ofan absence of a media signal, then the adaptive recording applicationmay select the tap to start communication mode.

In an embodiment, the adaptive recording application analyzes the mediasignal received during and/or after the activation of the interfaceelement for selecting one of the communication modes. During theanalysis of the media signal, the adaptive recording applicationdistinguishes between multiple sounds, for example, voice signals,audible sounds such as music, natural sounds such as birds chirping,etc., contained in the media signal. For example, if an audio signalcontains noise or background chatter, the adaptive recordingapplication, for the purpose of determining the communication mode andthe user's intent to record the audio signal without noise or backgroundchatter, eliminates the noise or background chatter with assistance fromnoise cancellation algorithms that are well known in the art. Noisecancellation may be used to remove background noise to better analyzethe audio signal. In another example, if the audio signal contains noiseor background chatter, the adaptive recording application, for purposeof determining the communication mode and the user's intent to recordthe audio signal along with noise or background chatter, does noteliminate the noise or background chatter.

Although the detailed description refers to presence or absencedetermination and analysis of an audio signal, the scope of the computerimplemented method and system disclosed herein is not limited topresence or absence determination and analysis of an audio signal butmay be extended to include presence or absence determination andanalysis of any media signal, for example, a video signal, anaudiovisual signal, a multimedia signal, etc.

The adaptive recording application triggers 106 the recording of themedia message using the selected communication mode. Once thecommunication mode is selected by the adaptive recording application,the recording of the media message continues in the selectedcommunication mode. The analysis of the media signal by the adaptiverecording application also determines whether the recording of the mediamessage has ended. If the recording of the media message has ended, theadaptive recording application automatically terminates the recording ofthe media message. In an embodiment, the adaptive recording applicationinitiates recording of the media message prior to the selection of thecommunication mode. For example, the recording of an audio messagestarts before the selection of the push to talk communication mode orthe tap to start communication mode. In another embodiment, the adaptiverecording application initiates recording of the media message after theselection of the communication mode. For example, the recording of anaudio message starts after the selection of the push to talkcommunication mode or the tap to start communication mode. The methodfor terminating the recording is based on the selection of thecommunication mode. For example, in the push to talk communication mode,the user presses an initiation button on a communication device andholds the initiation button for recording the audio message. The userreleases the initiation button after recording the audio message. Theadaptive recording application terminates the recording of the audiomessage on detecting the release of the initiation button. In anotherexample, in the tap to start communication mode, the user taps orpresses an initiation button and releases the initiation button to startrecording of the audio message. The user records the audio message andtaps on the initiation button again or taps a different button aftercompletion of recording of the audio message. In this example, theadaptive recording application terminates the recording of the audiomessage on detecting the second tap on the initiation button.

FIG. 1C exemplarily illustrates another embodiment of the computerimplemented method for determining a communication mode for recording amedia message on a communication device. The computer implemented methoddisclosed herein provides 101 the adaptive recording application asdisclosed in the detailed description of FIG. 1A. The adaptive recordingapplication detects 102 activation of an interface element on the user'scommunication device for initiating the recording of the media messageas disclosed in the detailed description of FIG. 1A. The adaptiverecording application determines 103 a duration of the activation of theinterface element as disclosed in the detailed description of FIG. 1A.In this embodiment, the adaptive recording application selects 109, inresponse to the detection of the activation of the interface element,one of the communication modes for recording the media message based ona combination of the comparison 104 of the duration of the activation ofthe interface element with one or more configurable time parameters asdisclosed in the detailed description of FIG. 1A, and the determination107 of a presence or an absence of a media signal during and/or afterthe activation of the interface element as disclosed in the detaileddescription of FIG. 1B. The adaptive recording application determinesthe presence or the absence of a media signal during the activation ofthe interface element, or after the activation of the interface element,or during and after the activation of the interface element. Theadaptive recording application triggers 106 the recording of the mediamessage using the selected communication mode. Once the communicationmode is selected by the adaptive recording application, the recording ofthe media message continues in the selected communication mode.

For purposes of illustration, the detailed description refers torecording of an audio message; however the scope of the computerimplemented method and system disclosed herein is not limited torecording of an audio message but may be extended to include recordingof a video message or a video clip, an audiovisual message, a multimediamessage, etc., or any combination thereof.

FIG. 2 exemplarily illustrates a flowchart comprising the steps fordetermining a communication mode for recording an audio message on acommunication device based on a comparison of a duration of anactivation of an interface element with a critical time. In thisembodiment, the configurable timing parameter is the critical time. Thecritical time can be fixed as a system settable parameter, as a usersettable parameter, or may vary and be calculated based on one or moreof a number of factors, for example, type of the communication device,characteristics of the user, user actions, past user interactions,language, etc. The adaptive recording application can also learn andchange the critical time based on the user's behavior with the adaptiverecording application. In an example, the adaptive recording applicationstarts with a critical time of 0.3 seconds, but determines that the useroften keeps the interface element on the communication device pressedfor about 0.5 seconds before releasing the interface element, and thenstarts speaking; indicating that a critical time of 0.5 seconds may bemore suitable for the user. The adaptive recording application thereforelearns and changes the critical time to 0.5 seconds for that particularuser. In another example, the critical time may be longer in a slowercommunication device as compared to a newer and faster communicationdevice.

As exemplarily illustrated in FIG. 2, the adaptive recording applicationprovides the user with a default screen 201 showing recording controlson the communication device. The adaptive recording application detectsan activation of the interface element, for example, when the userclicks 202 or presses and holds an initiation button and releases theinitiation button after a certain duration of time. The adaptiverecording application initiates 203 recording of an audio message andmeasures the duration of activation or click of the interface element.The adaptive recording application then compares 204 the duration of theactivation of the interface element with the critical time. If theduration of the activation of the interface element is greater than thecritical time, the adaptive recording application selects the push totalk communication mode 205. If the duration of the activation of theinterface element is less than the critical time, the adaptive recordingapplication selects the tap to start communication mode 212. Theadaptive recording application completes recording the audio messageusing one of the selected communication modes. After selection of thecommunication mode, the adaptive recording application provides anindication 206 to the user to start talking. As the user talks 207, theadaptive recording application detects an end of the recording when theuser lets go off or releases 208 the initiation button or when the userstops talking 214 or taps a stop button 213, and triggers termination209 of the recording. The adaptive recording application saves 210 therecorded audio message and provides an indication 211 to the user thatthe recorded audio message is saved. This embodiment eliminates the stepof having to decide and manually switch between the communication modeswhile trying to record an audio message on a communication device, andallows the user to seamlessly use either of the communication modes forany utterance or audible sounds.

The computer implemented method disclosed herein allows the user torecord and transmit voice messages to his/her friends, without worryingabout the communication mode to use. The user can conveniently andseamlessly use either of the communication modes, for example, the pushto talk communication mode or the tap to start communication mode, withthe adaptive recording application performing the operation of deducingwhich communication mode is intended to be used by the user, andadapting the recording of the audio message automatically for thatcommunication mode. Consider an example where the adaptive recordingapplication configures the critical time to 100 milliseconds. If theuser taps the interface element on the communication device for a timeshorter than the critical time, the adaptive recording applicationdeduces that the user intends to use the tap to start communication modeand invokes a tap to start screen on the communication device forprompting the user to start recording an audio message. The user maythen record a lengthy audio message and then press a stop button on thecommunication device, which will save and send the user's audio message.

The user may then wish to invite his/her other friends to go to themovies, for example, by sending them a short 3 second audio message. Theuser would only need to press down on the initiation button and keep theinitiation button pressed to record the short audio message. After thecritical time passes and the user's finger is still on the initiationbutton, the adaptive recording application deduces that the user intendsto use the push to talk communication mode, provides an indication thatthe push to talk communication mode is active, and sends an indicationto the user to start recording the audio message. In order to terminatethe recording, the user only needs to release the initiation button.

Different examples are disclosed below where a user activates aninterface element, also referred to as a “button”, and the adaptiverecording application selects a communication mode for recording theaudio message based on a combination of the comparison of the durationof the activation of the button with the critical time and thedetermination of a presence or an absence of an audio signal duringand/or after the activation of the button. In an example, when the userpresses a button on the communication device for a short duration oftime less than the critical time and remains silent while pressing thebutton, the adaptive recording application selects the tap to startcommunication mode. In another example, when the user presses a buttonfor a short duration of time less than the critical time, remains silentwhile pressing the button for some time after pressing the button, andthen speaks, the adaptive recording application selects the tap to startcommunication mode. In another example, when the user presses a buttonfor a short duration of time less than the critical time, and thenspeaks while pressing the button, the adaptive recording applicationstill selects the tap to start communication mode.

In another example, when the user presses a button for a long durationof time greater than the critical time and remains silent while pressingthe button, the adaptive recording application selects the push to talkcommunication mode. In another example, when the user presses a buttonfor a long duration of time greater than the critical time, remainssilent while pressing the button and then speaks after releasing thebutton, the adaptive recording application selects the push to talkcommunication mode. In another example, when the user presses a buttonfor a long duration of time greater than the critical time and speakswhile pressing the button, the adaptive recording application stillselects the push to talk communication mode.

FIG. 3 exemplarily illustrates a flowchart comprising the steps fordetermining a communication mode for recording an audio message on acommunication device based on a determination of a presence or anabsence of an audio signal. In an example, the adaptive recordingapplication detects activation of at least two interface elements, forexample, a hardware or software implemented tactile interface elementand an audio interface element. The audio interface element, forexample, a microphone is capable of capturing an audio signal andproviding the audio signal to the adaptive recording application. Thetactile interface element is capable of capturing an application ofpressure on the tactile interface element. The user may click on thetactile interface element to initiate recording of an audio message andactivate the audio interface element to terminate the recording using avoice command.

The adaptive recording application provides the user with a defaultscreen 301 showing recording controls on the communication device. Theadaptive recording application detects an activation of the tactileinterface element, for example, a click on an initiation button 302 onthe user's communication device and initiates 303 recording of an audiomessage. The adaptive recording application then checks 304 whether theuser is talking while pressing the initiation button by detectingpresence or absence of an audio signal. If the adaptive recordingapplication detects an activation of the tactile interface element anddetermines a presence of an audio signal, the adaptive recordingapplication selects the push to talk communication mode 305 forrecording the audio message. That is, if the user presses down on theinitiation button and starts talking while the initiation button ispressed, the adaptive recording application recognizes that words arebeing spoken while the initiation button is pressed down, and deducesthat the user intends to use the push to talk communication mode. As theuser talks 306, the audio message is recorded in the push to talkcommunication mode on the communication device. After completing theaudio message, the user lets go off or releases 307 the initiationbutton on the communication device, which triggers termination 308 ofthe recording. The adaptive recording application saves 309 the recordedaudio message and provides an indication 310 to the user that therecorded audio message is saved.

If the adaptive recording application determines an absence of an audiosignal during the activation of the tactile interface, the adaptiverecording application selects the tap to start communication mode 311for recording the audio message. That is, if there is a silence or alack of audible sounds such as music, natural sounds such as birdschirping, etc., while the user is pressing the initiation button, theadaptive recording application deduces that the user intends to use thetap to start communication mode. After the user releases the finger fromthe initiation button, that is, when the user is no longer pressing theinitiation button, the adaptive recording application provides anindication 312 to the user to start talking. As the user talks 306, theaudio message is recorded in the tap to start communication mode on thecommunication device. When the user stops talking 314 or taps 313 a stopbutton, the adaptive recording application detects an end of therecording and triggers termination 308 of the recording. The adaptiverecording application saves 309 the recorded audio message and providesan indication 310 to the user that the recorded audio message is saved.This embodiment eliminates problems a user may face on deciding andswitching between the communication modes while recording an audiomessage on a portable communication device, and allows the user toseamlessly use either of the communication modes for any utterance oraudible sounds. Furthermore, this embodiment provides more flexibilityto the user by relying on voice activity detection rather than just thepassage of time to determine what communication mode the user wants touse. The adaptive recording application performs voice activitydetection, for example, using speech recognition algorithms of NuanceCommunications Inc., at&t Inc., Google Inc., etc., and other techniquesthat are well known in the art.

FIG. 4 exemplarily illustrates a timing diagram showing a lowerthreshold time (L) and an upper threshold time (U) configured fordetermining a communication mode for recording a media message on acommunication device. In this embodiment, the configurable timeparameters comprise the lower threshold time and the upper thresholdtime. As used herein, the term “lower threshold time” refers to a timebefore which it is explicitly clear that the user's intention is to usethe tap to start communication mode based on the fact that the interfaceelement is held down for a short time. Also, as used herein, the term“upper threshold time” refers to a time after which it is explicitlyclear that the user intends to use the push to talk communication modebased on the fact that the user is holding the interface element downfor a long time. The timing diagram exemplarily illustrated in FIG. 4shows the lower threshold time being set to L seconds and the upperthreshold time being set to U seconds. In this embodiment, the adaptiverecording application compares the duration of the activation of theinterface element with the lower threshold time and the upper thresholdtime as disclosed in the detailed description of FIGS. 7A-7C.

The adaptive recording application selects one of the communicationmodes, for example, the tap to start communication mode if the durationof the activation of the interface element is less than the lowerthreshold time, and the other communication mode, for example, the pushto talk communication mode if the duration of the activation of theinterface element is greater than the upper threshold time. For example,if the user clicks a button for less than the lower threshold time, L,the adaptive recording application determines the click to be a tapbecause of the extremely short duration of the click and the adaptiverecording application then determines that the user is intending to usethe tap to start communication mode. If the user clicks a button for aduration of time greater than the upper threshold time, U, the adaptiverecording application interprets the button to be held for an extendedduration of time and then determines that the user is intending to usethe push to talk communication mode. If the user clicks on the buttonfor a duration of time between the lower threshold time L and the upperthreshold time U, the adaptive recording application determines theintent of the user using a hybrid method disclosed in the detaileddescription of FIGS. 7A-7C.

In an embodiment, if the duration of the activation of the interfaceelement is greater than the lower threshold time and less than the upperthreshold time, the adaptive recording application selects one of thecommunication modes based on one or more of multiple options comprising,for example, a presence or an absence of a media signal, proximity ofthe duration of the activation of the interface element to the lowerthreshold time or the upper threshold time, that is, which among thelower threshold time and the upper threshold time the tap or clicklength is closer to, type of the user's communication device, a userinput clarifying the communication mode intended by the user, etc. Theoptions comprise any method for selecting a communication mode. In anembodiment, the adaptive recording application provides a dialog boxaccessible on the user's communication device that prompts the user toselect which communication mode he/she would like to use.

FIG. 5 exemplarily illustrates a flow diagram comprising the steps forconfiguring a lower threshold time for determining a communication modefor recording a media message on a communication device. Consider anexample where the lower threshold time is configured at L seconds by auser input, a system setting, etc. The adaptive recording applicationdetects activation of an interface element on the communication device.That is, the adaptive recording application detects 501 a press or aclick of a button. After detection of the click of the button, theadaptive recording application measures 502 the length or duration ofthe user's click, if the adaptive recording application selects thecommunication mode to be the tap to start communication mode. Theadaptive recording application then updates 503 the calculation of thelower threshold time by running the length of the latest user's clickthrough a statistical measure to calculate the new lower threshold time.The adaptive recording application then sets 504 the updated value asthe new lower threshold time.

Consider an example where the lower threshold time is denoted by LTT.Consider t₁, t₂, . . . , t_(n), to be the set containing the length ofthe activation of the interface element during each user's interactionsin which the tap to start communication mode was selected to be used bythe adaptive recording application on the communication device, where“t₁” corresponds to the user's first interaction and each subsequentinteraction corresponds to the next element in the set. Consider LTT₀ tobe the lower threshold time initially set at the start before the userhas performed any click. LTT₀ is initialized, for example, using asystem setting. Consider LTT_(n) to be the lower threshold time afterthe user has performed “n” clicks in which the tap to startcommunication mode was selected to be used by the adaptive recordingapplication on the communication device. LTT_(n) may be calculated byany statistical measure from LTT₀, t₁, . . . , t_(n).

Enumerated below are examples of the statistical measures used forcalculating LTT_(n). In a first example, LTT_(n) can be the average ofall elements in the set containing the lengths of each of the user'sclicks when the communication mode selected by the adaptive recordingapplication is the tap to start communication mode, as shown below:

LTT _(n)=avg(LTT ₀ ,t ₁ , . . . ,t _(n))

In an second example, if “n” is greater than a number such as 5, LTT_(n)can be the average of all elements in the set containing the lengths ofeach of the user's clicks when the communication mode selected by theadaptive recording application is the tap to start communication mode,excluding the set's maximum and minimum. In a third example, LTT_(n) canbe the maximum of all elements in the set containing the lengths of eachof the user's clicks when the communication mode selected by theadaptive recording application is the tap to start communication mode,as shown below:

LTT _(n)=max(LTT ₀ ,t ₁ , . . . ,t _(n))

In a fourth example, LTT_(n) can be the minimum of all elements in theset containing the lengths of each of the user's clicks when thecommunication mode selected by the adaptive recording application is thetap to start communication mode, as shown below:

LTT _(n)=min(LTT ₀ ,t ₁ , . . . ,t _(n))

FIG. 6 exemplarily illustrates a flow diagram comprising the steps forconfiguring an upper threshold time for determining a communication modefor recording a media message on a communication device. Consider anexample where the upper threshold time is configured at U seconds by auser input, a system setting, etc. The adaptive recording applicationdetects activation of an interface element on the communication device.That is, the adaptive recording application detects 601 the beginning ofa press or click of a button. After detection of the beginning of theactivation of interface element, the adaptive recording applicationmeasures 602 the length of time between the beginning of the user's mostrecent click and when the user begins speaking, for the case when theadaptive recording application selects the communication mode to be thepush to talk communication mode. The adaptive recording application thenupdates 603 the calculation of the upper threshold time by running thelength between the beginning of the user's most recent click and whenthe user begins speaking through a statistical measure to calculate thenew upper threshold time. The adaptive recording application then sets604 the updated value as the new upper threshold time.

Consider an example where the upper threshold time is denoted by UTT.Consider t₁, t₂, . . . , t_(n), to be the set containing the lengthsbetween the beginning of the activation of the interface and when theuser first starts speaking, for the cases when the push to talkcommunication mode is selected by the adaptive recording application,where “t₁” corresponds to the user's first interaction and eachsubsequent interaction corresponds to the next element in the set.Consider UTT₀ to be the upper threshold time initially set at the startbefore the user has performed any click. UTT₀ is initialized, forexample, using a system setting. Consider UTT_(n) to be the upperthreshold time after the user has performed “n” interactions in whichthe push to talk communication mode was selected to be used by theadaptive recording application. UTT_(n) may be calculated by anystatistical measure from UTT₀, t₁, . . . , t_(n).

Enumerated below are examples of the statistical measures used forcalculating UTT_(n). In a first example, UTT_(n) can be the average ofall elements in the set, as shown below:

UTT _(n)=avg(LTT ₀ ,t ₁ , . . . ,t _(n))

In a second example, if “n” is greater than a number such as 5, UTT_(n)can be the average of all elements in the set, excluding the set'smaximum and minimum. In a third example, UTT_(n) can the maximum of allelements in the set, as shown below:

UTT _(n)=max(LTT ₀ ,t ₁ , . . . ,t _(n))

In a fourth example, UTT_(n) can be the minimum of all elements in theset, as shown below:

UTT _(n)=min(LTT ₀ ,t ₁ , . . . ,t _(n))

In an embodiment, the adaptive recording application engages the user ina training session for adaptively configuring the lower threshold timeand the upper threshold time for determining a communication mode forrecording a media message on a communication device. The configurationof the lower threshold time and the upper threshold time is customizedbased on how the user selects the communication modes. The adaptiverecording application instructs the user to repeatedly press and releasethe interface element while intending to use the push to talkcommunication mode. The adaptive recording application then instructsthe user to repeatedly press and release the interface element whileintending to use the tap to start communication mode. The adaptiverecording application then sets the lower threshold time and the upperthreshold time based on an average formula or another statisticalmeasure. For example, during the training session of the push to talkcommunication mode if the user required 1.0 second to press and releasethe interface element for the first trial, 1.0 second to press andrelease the interface element for the second trial, and 2.0 seconds topress and release the interface element for the third trial, then theadaptive recording application sets the upper threshold time to 1.33seconds. In another example, during the training session of the tap tostart communication mode if the user required 0.5 seconds to press andrelease the interface element for the first trial, 0.5 seconds to pressand release the interface element for the second trial, and 0.3 secondsto press and release the interface element for the third trial, then theadaptive recording application sets the lower threshold time to 0.43seconds.

In another embodiment, the lower threshold time and the upper thresholdtime can be used to calculate the critical time that is used to selectthe communication mode for recording a media message, for example, anaudio message disclosed in the detailed description of FIG. 2. In thisembodiment, the critical time is a time that falls between the upperthreshold time and the lower threshold time. For example, the criticaltime can be the average of the upper threshold time and the lowerthreshold time. In another example, the critical time can be the thirdquartile or the fourth decile of the upper threshold time and the lowerthreshold time. In an embodiment, the upper threshold time and the lowerthreshold time can be directly used by a hybrid method as disclosed inthe detailed description of FIGS. 7A-7C, for determining thecommunication mode for initiating recording of a media message.

FIGS. 7A-7C exemplarily illustrate a flowchart comprising the steps fordetermining a communication mode based on a comparison of the durationof the activation of the interface element with a lower threshold timeand an upper threshold time. The adaptive recording application providesthe user with a default screen showing recording controls on thecommunication device. The adaptive recording application detects anactivation of the interface element, for example, a click 701 on aninitiation button on the user's communication device. The adaptiverecording application initiates 702 recording of an audio message andsets 703 the communication mode to the push to talk communication modeso that any audio message starts recording. The adaptive recordingapplication then detects 704 a release of a click on the initiationbutton. The adaptive recording application compares the duration of theclick with the lower threshold time and the upper threshold time. Forexample, the adaptive recording application checks 705 if the durationof the click is less than the lower threshold time. If the duration ofthe click is less than the lower threshold time, the adaptive recordingapplication sets 706 the communication mode to the tap to startcommunication mode for recording the audio message. As the user talks710, the adaptive recording application detects an end of the recordingwhen the user stops talking 713 or taps 712 a stop button exemplarilyillustrated in FIG. 7B, and triggers termination 714 of the recording.The adaptive recording application saves 715 the recorded audio messageand provides an indication 716 to the user that the recorded audiomessage is saved as exemplarily illustrated in FIG. 7C.

If the duration of the click is not less than the lower threshold, theadaptive recording application checks 707 whether the duration of theclick is greater than the upper threshold time. If the duration of theclick is greater than the upper threshold time, the adaptive recordingapplication continues to record the audio, until the user releases theinitiation button, at which time the adaptive recording applicationtriggers termination 714 of the recording. The adaptive recordingapplication saves 715 the recorded audio message and provides anindication 716 to the user that the recorded audio message is saved. Ifthe duration of the click is less than the upper threshold time andgreater than the lower threshold time, then the adaptive recordingapplication checks 708 whether the user is talking after the initiationbutton is released. If the user is talking after the initiation buttonis released, the adaptive recording application sets 706 thecommunication mode to the tap to start communication mode. The usertalks 710 and the adaptive recording application detects an end of therecording when the user stops talking 713 or taps 712 a stop button, andtriggers termination 714 of the recording. The adaptive recordingapplication saves 715 the recorded audio message and provides anindication 716 to the user that the recorded audio message is saved. Ifthe user is not talking after the initiation button is released, thenthe adaptive recording application checks 709 whether the user wastalking while the initiation button was pressed. If the user was talkingwhile the button was pressed, then the adaptive recording applicationcontinues in the push to talk communication mode. The user talks 710 andreleases the initiation button 711 to trigger the termination 714 of therecording. The adaptive recording application saves 715 the recordedaudio message and provides an indication 716 to the user that therecorded audio message is saved. If the user is not talking during thetime the initiation button was pressed, the adaptive recordingapplication sets 706 the communication mode to the tap to startcommunication mode. The user talks 710 and the adaptive recordingapplication detects an end of the recording when the user stops talking713 or taps 712 a stop button, and triggers termination 714 of therecording. The adaptive recording application saves 715 the recordedaudio message and provides an indication 716 to the user that therecorded audio message is saved.

Consider an example, where a lower threshold time is set to 0.2 secondsand an upper threshold time is set to 0.4 seconds. In the hybrid method,for a click time less than 0.2 seconds, the adaptive recordingapplication determines the user's click to be a tap because of the shortduration of the click and determines that the user is intending to usethe tap to start communication mode. For a click time greater than 0.4seconds, the adaptive recording application determines that the user isholding the button down for an extended duration of time and thereforeintends to use the push to talk communication mode. When a user keeps abutton pressed for a time between 0.2 seconds and 0.4 seconds, since theintent of the user may be unclear, the adaptive recording applicationchecks whether the user is talking or not after the release of thebutton. If the user is talking after the release of the button, theadaptive recording application determines that the user wishes tocontinue talking and sets the communication mode to the tap to startcommunication mode. If the user is not talking when the button isreleased, then the adaptive recording application checks the recordingduring the click duration to determine the communication mode. If theuser was talking during the click duration, then the adaptive recordingapplication continues with the push to talk communication mode. If theuser was not talking during the click duration, then the adaptiverecording application sets the communication mode to the tap to startcommunication mode.

FIG. 8 exemplarily illustrates a computer implemented system 800 fordetermining a communication mode for recording a media message on acommunication device 801. The communication mode is, for example, a pushto talk communication mode, a tap to start communication mode, etc. Thecommunication device 801 is, for example, a cellular phone, a smartphone, a tablet computing device, a laptop, etc., or any other mobiledevice configured for mobile communication. The computer implementedsystem 800 disclosed herein comprises the adaptive recording application802 deployed on a user's communication device 801. The computerimplemented system 800 disclosed herein further comprises at least oneprocessor configured to execute modules 803, 804, 805, 806, 807, 808,809, etc., of the adaptive recording application 802, and anon-transitory computer readable storage medium configured to store themodules 803, 804, 805, 806, 807, 808, 809, etc., of the adaptiverecording application 802. The non-transitory computer readable storagemedium is communicatively coupled to the processor. The adaptiverecording application 802 comprises a detection module 803, anactivation duration determination module 804, a comparison module 805, amedia signal determination module 806, a mode selection module 807, atime configuration module 808, and a record trigger module 809.

The detection module 803 detects activation of an interface element onthe communication device 801 for initiating recording of the mediamessage. The activation duration determination module 804 determines aduration of the activation of the interface element. The comparisonmodule 805 compares the duration of the activation of the interfaceelement with one or more configurable timing parameters, for example, acritical time, a lower threshold time, an upper threshold time, etc. Inan embodiment, the comparison module 805 compares the duration of theactivation of the interface element with the critical time. In anotherembodiment, the comparison module 805 compares the duration of theactivation of the interface element with the lower threshold time andthe upper threshold time. The mode selection module 807 selects, inresponse to the detection of the activation of the interface element,one of the communication modes for recording the media message based onone or a combination of the comparison of the duration of the activationof the interface element with the configurable timing parametersperformed by the comparison module 805 and a determination of a presenceor an absence of a media signal during and/or after the activation ofthe interface element performed by the media signal determination module806. In an embodiment, the mode selection module 807 selects one of thecommunication modes, for example, the tap to start communication mode,if the duration of the activation of the interface element is less thanthe critical time, and another of the communication modes, for example,the push to talk communication mode if the duration of the activation ofthe interface element is greater than the critical time. In anotherembodiment, the mode selection module 807 selects one of thecommunication modes, for example, the tap to start communication mode ifthe duration of the activation of the interface element is less than thelower threshold time, and another of the communication modes, forexample, the push to talk communication mode if the duration of theactivation of the interface element is greater than the upper thresholdtime. In another embodiment, the mode selection module 807 selects oneof the communication modes based on one or more of multiple options, ifthe duration of activation of the interface element is greater than thelower threshold time and less than the upper threshold time. The optionscomprise, for example, the presence or the absence of a media signal,proximity of the duration of activation of the interface element to thelower threshold time or the upper threshold time, type of thecommunication device 801, a user input clarifying the communication modeintended by the user, etc.

The time configuration module 808 configures the configurable timingparameters, for example, based on a user input, a system settingconfigured by the adaptive recording application 802, an analysis of theduration of the activation of the interface element, and/or past userbehavior determined from prior activations of the interface element bythe user in the past, or during a training session. The media signaldetermination module 806 determines the presence or the absence of themedia signal during and/or after the activation of the interfaceelement. Furthermore, the media signal determination module 806 analyzesthe media signal received during and/or after the activation of theinterface element for selection of one of the communication modes. Therecord trigger module 809 triggers the recording of the media messageusing the selected communication mode.

FIG. 9 exemplarily illustrates the architecture of a computer system 900employed by the adaptive recording application 802 for determining acommunication mode for recording a media message on a communicationdevice 801. The adaptive recording application 802 of the computerimplemented system 800 exemplarily illustrated in FIG. 8 employs thearchitecture of the computer system 900 exemplarily illustrated in FIG.9. The computer system 900 is programmable using a high level computerprogramming language. The computer system 900 may be implemented usingprogrammed and purposeful hardware. The computer system 900 comprises,for example, a processor 901, a memory unit 902 for storing programs anddata, an input/output (I/O) controller 903, a network interface 904, adata bus 905, a display unit 906, input devices 907, a fixed media drive908, a removable media drive 909 for receiving removable media, outputdevices 910, etc.

The term “processor” refers to any one or more microprocessors, centralprocessing unit (CPU) devices, finite state machines, computers,microcontrollers, digital signal processors, logic, a logic device, anelectronic circuit, an application specific integrated circuit (ASIC), afield-programmable gate array (FPGA), a chip, etc., or any combinationthereof, capable of executing computer programs or a series of commands,instructions, or state transitions. The processor 901 may also beimplemented as a processor set comprising, for example, a generalpurpose microprocessor and a math or graphics co-processor. Theprocessor 901 is selected, for example, from the Intel® processors suchas the Itanium® microprocessor or the Pentium® processors, AdvancedMicro Devices (AMD®) processors such as the Athlon® processor,UltraSPARC® processors, microSPARC® processors, hp® processors,International Business Machines (IBM®) processors such as the PowerPC®microprocessor, the MIPS® reduced instruction set computer (RISC)processor of MIPS Technologies, Inc., RISC based computer processors ofARM Holdings, Motorola® processors, etc. The computer implemented system800 disclosed herein is not limited to the computer system 900 employingthe processor 901. The computer system 900 may also employ a controlleror a microcontroller.

The memory unit 902 is used for storing programs, applications, anddata. For example, the detection module 803, the activation durationdetermination module 804, the comparison module 805, the media signaldetermination module 806, the mode selection module 807, the timeconfiguration module 808, the record trigger module 809, etc., of theadaptive recording application 802 are stored in the memory unit 902 ofthe computer system 900. The memory unit 902 is, for example, a randomaccess memory (RAM) or another type of dynamic storage device thatstores information and instructions for execution by the processor 901.The memory unit 902 also stores temporary variables and otherintermediate information used during execution of the instructions bythe processor 901. The computer system 900 further comprises a read onlymemory (ROM) or another type of static storage device that stores staticinformation and instructions for the processor 901.

The I/O controller 903 controls input actions and output actionsperformed by the adaptive recording application 802. The networkinterface 904 enables connection of the computer system 900 to anetwork, for example, the internet, a mobile communication network, etc.For example, the computer system 900 communicates with other interactingdevices, for example, another mobile device (not shown), through thenetwork interface 904. In an embodiment, the network interface 904 isprovided as an interface card also referred to as a line card. Thenetwork interface 904 comprises, for example, one or more of an infrared(IR) interface, an interface implementing WiFi® of the Wireless EthernetCompatibility Alliance, Inc., a universal serial bus (USB) interface, aFireWire® interface of Apple, Inc., an Ethernet interface, a frame relayinterface, a cable interface, a digital subscriber line (DSL) interface,a token ring interface, a peripheral controller interconnect (PCI)interface, a local area network (LAN) interface, a wide area network(WAN) interface, interfaces using serial protocols, interfaces usingparallel protocols, and Ethernet communication interfaces, asynchronoustransfer mode (ATM) interfaces, a high-speed serial interface (HSSI), afiber distributed data interface (FDDI), interfaces based ontransmission control protocol (TCP)/internet protocol (IP), interfacesbased on wireless communications technology such as satellitetechnology, radio frequency (RF) technology, near field communication,etc. The data bus 905 permits communication between the modules, forexample, 803, 804, 805, 806, 807, 808, 809, etc., of the adaptiverecording application 802.

The display unit 906 via a graphical user interface (GUI), displaysinformation, display interfaces, user interface elements such as textfields, checkboxes, text boxes, windows, etc., for allowing the user,for example, to record and send a media message. The display unit 906comprises, for example, a liquid crystal display, a plasma display, anorganic light emitting diode (OLED) based display, etc. The inputdevices 907 are used for inputting data, for example, a media message,into the computer system 900. The input devices 907 are, for example, amicrophone for inputting voice, a physical button, a touch pad, a touchsensitive display device or any device capable of sensing a tactileinput, etc. The input devices 907 further comprise a keyboard such as analphanumeric keyboard for entering a text message, a joystick, apointing device such as a computer mouse, a light pen, a pointingdevice, a track ball, a pointing stick, etc.

Computer applications and programs are used for operating the computersystem 900. The programs are loaded onto the fixed media drive 908 andinto the memory unit 902 of the computer system 900 via the removablemedia drive 909. In an embodiment, the computer applications andprograms may be loaded directly via a network, for example, a Wi-Fi®network. Computer applications and programs are executed by doubleclicking a related icon displayed on the display unit 906 using one ofthe input devices 907. The output devices 910 output the results of theoperations performed by the adaptive recording application 802, forexample, for viewing by a user of the communication device 801. Forexample, the adaptive recording application 802 notifies a user of aselected communication mode, about the recorded media message generatedon successful recording of the media message, etc., through a pop-upwindow on the output device 910 such as the display unit 906 of thecommunication device 801. In another example, the adaptive recordingapplication 802 prompts the user to select which communication modehe/she would like to use through a dialog box on the output device 910such as the display unit 906 of the communication device 801.

The processor 901 executes an operating system, for example, the Linux®operating system, the Unix® operating system, any version of theMicrosoft® Windows® operating system, the Mac OS of Apple Inc., the IBM®OS/2, VxWorks® of Wind River Systems, inc., QNX Neutrino® developed byQNX Software Systems Ltd., Palm OS®, the Solaris operating systemdeveloped by Sun Microsystems, Inc., the Android operating system,Windows Phone™ operating system of Microsoft Corporation, BlackBerry®operating system of Research in Motion Limited, the iOS operating systemof Apple Inc., the Symbian® operating system of Symbian FoundationLimited, etc. The computer system 900 employs an operating system forperforming multiple tasks. The operating system is responsible formanagement and coordination of activities and sharing of resources ofthe computer system 900. The operating system further manages securityof the computer system 900, peripheral devices connected to the computersystem 900, and network connections. The operating system employed onthe computer system 900 recognizes, for example, inputs provided by auser using one of the input devices 907, the output display, files, anddirectories stored locally on the fixed media drive 908, for example, ahard drive. The operating system on the computer system 900 executesdifferent programs using the processor 901. The processor 901 and theoperating system together define a computer platform for whichapplication programs in high level programming languages are written.

The processor 901 retrieves the instructions for executing the modules,for example, 803, 804, 805, 806, 807, 808, 809, etc., of the adaptiverecording application 802. A program counter determines the location ofthe instructions in the memory unit 902. The program counter stores anumber that identifies a current position in the program of each of themodules, for example, 803, 804, 805, 806, 807, 808, 809, etc., of theadaptive recording application 802. The instructions fetched by theprocessor 901 from the memory unit 902 after being processed aredecoded. The instructions are stored in an instruction register in theprocessor 901. After processing and decoding, the processor 901 executesthe instructions. For example, the detection module 803 definesinstructions for detecting activation of an interface element on thecommunication device 801 for initiating recording of a media message.The activation duration determination module 804 defines instructionsfor determining a duration of the activation of the interface element.The time configuration module 808 defines instructions for configuringthe configurable timing parameters, for example, based on a user input,a system setting configured by the adaptive recording application 802,an analysis of the duration of the activation of the interface element,and/or past user behavior determined from prior activations of theinterface element by the user in the past, or during a training session.The comparison module 805 defines instructions for comparing theduration of the activation of the interface element with one or moreconfigurable timing parameters, for example, a critical time, a lowerthreshold time, an upper threshold time, etc. In an embodiment, thecomparison module 805 defines instructions for comparing the duration ofthe activation of the interface element with the critical time. Inanother embodiment, the comparison module 805 defines instructions forcomparing the duration of the activation of the interface element withthe lower threshold time and the upper threshold time.

The media signal determination module 806 defines instructions fordetermining the presence or the absence of a media signal during and/orafter the activation of the interface element. In an embodiment, themedia signal determination module 806 defines instructions for analyzingthe media signal received during and/or after the activation of theinterface element for the selection of one of the communication modes.The record trigger module 809 defines instructions for triggering therecording of the media message using the selected communication mode.The mode selection module 807 defines instructions for selecting, inresponse to the detection of the activation of the interface element onthe communication device 801, one of multiple communication modes forrecording the media message based on one or a combination of thecomparison of the duration of the activation of the interface elementwith one or more configurable timing parameters and the determination ofa presence or an absence of a media signal during and/or after theactivation of the interface element.

In an embodiment, the mode selection module 807 defines instructions forselecting one of the communication modes, for example, the tap to startcommunication mode, if the duration of the activation of the interfaceelement is less than the critical time, and another of the communicationmodes, for example, the push to talk communication mode if the durationof the activation of the interface element is greater than the criticaltime. In another embodiment, the mode selection module 807 definesinstructions for selecting one of the communication modes, for example,the tap to start communication mode, if the duration of the activationof the interface element is less than the lower threshold time, andanother of the communication modes, for example, the push to talkcommunication mode if the duration of the activation of the interfaceelement is greater than the upper threshold time. In another embodiment,the mode selection module 807 defines instructions for selecting one ofthe communication modes based on one or more options comprising, forexample, the presence or the absence of the media signal, proximity ofthe duration of activation of the interface element to the lowerthreshold time or the upper threshold time, type of the communicationdevice 801, a user input clarifying the communication mode intended bythe user, etc., if the duration of activation of the interface elementis greater than the lower threshold time and less than the upperthreshold time.

The processor 901 of the computer system 900 employed by the adaptiverecording application 802 retrieves the instructions defined by thedetection module 803, the activation duration determination module 804,the comparison module 805, the media signal determination module 806,the mode selection module 807, the time configuration module 808, therecord trigger module 809, etc., of the adaptive recording application802 and executes the instructions, thereby performing one or moreprocesses defined by those instructions.

At the time of execution, the instructions stored in the instructionregister are examined to determine the operations to be performed. Theprocessor 901 then performs the specified operations. The operationscomprise arithmetic operations and logic operations. The operatingsystem performs multiple routines for performing a number of tasksrequired to assign the input devices 907, the output devices 910, andmemory for execution of the modules, for example, 803, 804, 805, 806,807, 808, 809, etc., of the adaptive recording application 802. Thetasks performed by the operating system comprise, for example, assigningmemory to the modules, for example, 803, 804, 805, 806, 807, 808, 809,etc., of the adaptive recording application 802, and to data used by theadaptive recording application 802, moving data between the memory unit902 and disk units, and handling input/output operations. The operatingsystem performs the tasks on request by the operations and afterperforming the tasks, the operating system transfers the executioncontrol back to the processor 901. The processor 901 continues theexecution to obtain one or more outputs. The outputs of the execution ofthe modules, for example, 803, 804, 805, 806, 807, 808, 809, etc., ofthe adaptive recording application 802 are displayed to the user on thedisplay unit 906.

For purposes of illustration, the detailed description refers to theadaptive recording application 802 disclosed herein being run locally onthe computer system 900; however the scope of the computer implementedmethod and system 800 disclosed herein is not limited to the adaptiverecording application 802 being run locally on the computer system 900via the operating system and the processor 901, but may be extended torun remotely over the network, for example, by employing a web browserand a remote server, a mobile phone, or other electronic devices. One ormore portions of the computer system 900 may be distributed across oneor more computer systems (not shown) coupled to the network.

Disclosed herein is also a computer program product comprising anon-transitory computer readable storage medium that stores computerprogram codes comprising instructions executable by at least oneprocessor 901 of the computer system 900 for determining a communicationmode for recording a media message on a communication device 801. Asused herein, the term “non-transitory computer readable storage medium”refers to all computer readable media, for example, non-volatile mediasuch as optical discs or magnetic disks, volatile media such as aregister memory, a processor cache, etc., and transmission media such aswires that constitute a system bus coupled to the processor 901, exceptfor a transitory, propagating signal.

The computer program product disclosed herein comprises multiplecomputer program codes for determining a communication mode forrecording a media message on a user's communication device 801. Forexample, the computer program product disclosed herein comprises a firstcomputer program code for detecting activation of an interface elementon the communication device 801 for initiating the recording of themedia message; a second computer program code for determining a durationof the activation of the interface element; a third computer programcode for comparing the duration of the activation of the interfaceelement with one or more configurable timing parameters; and a fourthcomputer program code for selecting, in response to the detection of theactivation of the interface element, one of multiple communication modesfor recording the media message based on one or a combination of thecomparison of the duration of the activation of the interface elementwith one or more configurable timing parameters and the determination ofa presence or an absence of a media signal during and/or after theactivation of the interface element.

The computer program product disclosed herein further comprises a fifthcomputer program code for configuring the configurable timingparameters, for example, based on a user input, a system settingconfigured by the adaptive recording application 802, an analysis of theduration of the activation of the interface element, and/or past userbehavior determined from the activation of the interface element by theuser in the past, or during a training session. The computer programproduct disclosed herein further comprises a sixth computer program codefor analyzing the media signal received during and/or after theactivation of the interface element for the selection of one of thecommunication modes. The computer program product disclosed hereinfurther comprises one or more additional computer program codes forperforming additional steps that may be required and contemplated fordetermining a communication mode for recording the media message on thecommunication device 801. In an embodiment, a single piece of computerprogram code comprising computer executable instructions performs one ormore steps of the computer implemented method disclosed herein fordetermining a communication mode for recording the media message on thecommunication device 801.

The computer program codes comprising the computer executableinstructions are embodied on the non-transitory computer readablestorage medium. The processor 901 of the computer system 900 retrievesthese computer executable instructions and executes them. When thecomputer executable instructions are executed by the processor 901, thecomputer executable instructions cause the processor 901 to perform thesteps of the computer implemented method for determining a communicationmode for recording the media message on the communication device 801.

It will be readily apparent that the various methods, algorithms, andcomputer programs disclosed herein may be implemented on computerreadable media appropriately programmed for general purpose computers,computing devices, and communication devices. As used herein, the term“computer readable media” refers to non-transitory computer readablemedia that participate in providing data, for example, instructions thatmay be read by a computer, a processor or a similar device.Non-transitory computer readable media comprise all computer readablemedia, for example, non-volatile media, volatile media, and transmissionmedia, except for a transitory, propagating signal. Non-volatile mediacomprise, for example, optical discs or magnetic disks and otherpersistent memory volatile media including a dynamic random accessmemory (DRAM), which typically constitutes a main memory. Volatile mediacomprise, for example, a register memory, a processor cache, a randomaccess memory (RAM), etc. Transmission media comprise, for example,coaxial cables, copper wire, fiber optic cables, modems, etc., includingwires that constitute a system bus coupled to a processor, etc. Commonforms of computer readable media comprise, for example, a floppy disk, aflexible disk, a hard disk, magnetic tape, a laser disc, a Blu-rayDisc®, any magnetic medium, a compact disc-read only memory (CD-ROM), adigital versatile disc (DVD), any optical medium, a flash memory card,punch cards, paper tape, any other physical medium with patterns ofholes, a random access memory (RAM), a programmable read only memory(PROM), an erasable programmable read only memory (EPROM), anelectrically erasable programmable read only memory (EEPROM), a flashmemory, any other memory chip or cartridge, or any other medium fromwhich a computer can read.

The computer programs that implement the methods and algorithmsdisclosed herein may be stored and transmitted using a variety of media,for example, the computer readable media in a number of manners. In anembodiment, hard-wired circuitry or custom hardware may be used in placeof, or in combination with, software instructions for implementation ofthe processes of various embodiments. Therefore, the embodiments are notlimited to any specific combination of hardware and software. Ingeneral, the computer program codes comprising computer executableinstructions may be implemented in any programming language. Someexamples of programming languages that can be used comprise C, C++, C#,Java®, Fortran, Ruby, Pascal, Perl®, Python®, Visual Basic®, MATLAB®,etc. Other object-oriented, functional, scripting, and/or logicalprogramming languages may also be used. The computer program codes orsoftware programs may be stored on or in one or more mediums as objectcode. Various aspects of the method and system disclosed herein may beimplemented as programmed elements, or non-programmed elements, or anysuitable combination thereof. The computer program product disclosedherein comprises computer executable instructions embodied in anon-transitory computer readable storage medium, wherein the computerprogram product comprises one or more computer program codes forimplementing the processes of various embodiments.

The present invention can be configured to work in a network environmentcomprising one or more computers that are in communication with one ormore devices via a network. The computers may communicate with thedevices directly or indirectly, via a wired medium or a wireless mediumsuch as the Internet, a local area network (LAN), a wide area network(WAN) or the Ethernet, a token ring, or via any appropriatecommunications mediums or combination of communications mediums. Each ofthe devices may comprise processors, for example, the Intel® processors,Advanced Micro Devices (AMD®) processors, UltraSPARC® processors, Hp®processors, International Business Machines (IBM®) processors, RISCbased computer processors of ARM Holdings, Motorola® processors, etc.,that are adapted to communicate with the computers. In an embodiment,each of the computers is equipped with a network communication device,for example, a network interface card, a modem, or other networkconnection device suitable for connecting to a network. Each of thecomputers and the devices executes an operating system, for example, theLinux® operating system, the Unix® operating system, any version of theMicrosoft® Windows® operating system, the Mac OS of Apple Inc., the IBM®OS/2, the Palm OS®, the Android® OS, the Blackberry® OS, the Solarisoperating system developed by Sun Microsystems, Inc., or any otheroperating system. Handheld devices execute operating systems, forexample, the Android operating system, the Windows Phone™ operatingsystem of Microsoft Corporation, the BlackBerry® operating system ofResearch in Motion Limited, the iOS operating system of Apple Inc., theSymbian® operating system of Symbian Foundation Limited, etc. While theoperating system may differ depending on the type of computer, theoperating system will continue to provide the appropriate communicationsprotocols to establish communication links with the network. Any numberand type of machines may be in communication with the computers. Thepresent invention is not limited to a particular computer systemplatform, processor, operating system, or network.

The foregoing examples have been provided merely for the purpose ofexplanation and are in no way to be construed as limiting of the presentinvention disclosed herein. While the invention has been described withreference to various embodiments, it is understood that the words, whichhave been used herein, are words of description and illustration, ratherthan words of limitation. Further, although the invention has beendescribed herein with reference to particular means, materials, andembodiments, the invention is not intended to be limited to theparticulars disclosed herein; rather, the invention extends to allfunctionally equivalent structures, methods and uses, such as are withinthe scope of the appended claims. Those skilled in the art, having thebenefit of the teachings of this specification, may affect numerousmodifications thereto and changes may be made without departing from thescope and spirit of the invention in its aspects.

I claim:
 1. A computer implemented method for determining acommunication mode, comprising: detecting activation of an interfaceelement on a user communication device; measuring duration of saidactivation of said interface element; comparing said measured durationwith a critical time; selecting a tap to start communication mode whensaid measured duration is less than or equal to said critical time;selecting a push to start communication mode when said measured durationis greater than said critical time; and providing an indication of saidselected communication mode on said user communication device.
 2. Thecomputer implemented method of claim 1, wherein said critical time isset to distinguish between said tap on said interface element and saidpress and hold on said interface element.
 3. The computer implementedmethod of claim 1, wherein said tap to start communication mode and saidpush to start communication mode is used to record a media message, andwherein said media message is one of an audio message, a video message,an audiovisual message, and a multimedia message.
 4. The computerimplemented method of claim 1, wherein said critical time is configuredbased on one of a user input, a system setting, an analysis of saidduration of said activation of said interface element, and past userbehavior determined from prior activations of said interface element. 5.A computer implemented system for determining a communication mode,comprising: a user communication device, comprising: at least oneprocessor configured to execute an application; a non-transitorycomputer readable storage medium communicatively coupled to said atleast one processor, said non-transitory computer readable storagemedium configured to store said application; a display unit configuredto display a recording interface comprising at least an interfaceelement; and said application comprising: a detection module configuredto detect user activation of said interface element on said displayunit; an activation duration detection module configured to measure aduration of said user activation of said interface element; a comparisonmodule configured to compare said measured duration with a criticaltime; and a mode selection module configured to select one of a tap tostart communication mode when said measured duration is less than orequal to said critical time, and a push to start communication mode whensaid measured duration is greater than said critical time; and saidapplication configured to provide an indication of said selectedcommunication mode on said user communication device.
 6. The computerimplemented system of claim 5, wherein said critical time is set todistinguish between said tap on said interface element and said pressand hold on said interface element.
 7. The computer implemented systemof claim 5, wherein said tap to start communication mode and said pushto start communication mode is used to record a media message, andwherein said media message is one of an audio message, a video message,an audiovisual message, and a multimedia message.
 8. The computerimplemented system of claim 5, wherein a time configuration module ofsaid application configures said critical time based on one of a userinput, a system setting, an analysis of said duration of said activationof said interface element, and past user behavior determined from prioractivations of said interface element.